Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 55734 |
| Missing cells | 203028 |
| Missing cells (%) | 18.2% |
| Duplicate rows | 585 |
| Duplicate rows (%) | 1.0% |
| Total size in memory | 8.5 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Categorical | 11 |
|---|---|
| Boolean | 2 |
| Numeric | 7 |
State Name has constant value "" | Constant |
| Dataset has 585 (1.0%) duplicate rows | Duplicates |
Original_Storage_Capacity is highly overall correlated with Present_Storage_Capacity | High correlation |
Present_Storage_Capacity is highly overall correlated with Original_Storage_Capacity | High correlation |
Reason_for_Water_Body_Use is highly overall correlated with Water_Body_Status | High correlation |
Scheme_Status_Reason is highly overall correlated with Water_Body_Status and 2 other fields | High correlation |
Water_Body_Status is highly overall correlated with Reason_for_Water_Body_Use and 4 other fields | High correlation |
construcion_year is highly overall correlated with construction_cost | High correlation |
construction_cost is highly overall correlated with construcion_year | High correlation |
no_people_benefited_by_water_body is highly overall correlated with Water_Body_Status | High correlation |
reason_water_body_in_use_name2 is highly overall correlated with Scheme_Status_Reason and 1 other fields | High correlation |
reason_water_body_in_use_name3 is highly overall correlated with Scheme_Status_Reason and 1 other fields | High correlation |
Area_Type is highly imbalanced (50.7%) | Imbalance |
Water_Body_Type is highly imbalanced (79.8%) | Imbalance |
Scheme_Status_Reason is highly imbalanced (67.7%) | Imbalance |
Repair_Renovation_Status is highly imbalanced (93.2%) | Imbalance |
construcion_year has 22811 (40.9%) missing values | Missing |
construction_cost has 22811 (40.9%) missing values | Missing |
Renovation_Year has 41682 (74.8%) missing values | Missing |
renovation_cost has 41682 (74.8%) missing values | Missing |
reason_water_body_in_use_name2 has 30662 (55.0%) missing values | Missing |
reason_water_body_in_use_name3 has 43380 (77.8%) missing values | Missing |
construction_cost is highly skewed (γ1 = 119.5673849) | Skewed |
renovation_cost is highly skewed (γ1 = 89.24662845) | Skewed |
Original_Storage_Capacity is highly skewed (γ1 = 109.9465346) | Skewed |
Present_Storage_Capacity is highly skewed (γ1 = 124.6175812) | Skewed |
no_people_benefited_by_water_body is highly skewed (γ1 = 114.8187724) | Skewed |
construction_cost has 766 (1.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-12-11 11:28:40.176706 |
|---|---|
| Analysis finished | 2023-12-11 11:29:14.307554 |
| Duration | 34.13 seconds |
| Software version | ydata-profiling vv4.6.2 |
| Download configuration | config.json |
Area_Type
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.6 KiB |
| Rural | |
|---|---|
| Urban |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 278670 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rural |
|---|---|
| 2nd row | Rural |
| 3rd row | Rural |
| 4th row | Rural |
| 5th row | Rural |
Common Values
| Value | Count | Frequency (%) |
| Rural | 49725 | |
| Urban | 6009 | 10.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rural | 49725 | |
| urban | 6009 | 10.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 55734 | |
| a | 55734 | |
| R | 49725 | |
| u | 49725 | |
| l | 49725 | |
| U | 6009 | 2.2% |
| b | 6009 | 2.2% |
| n | 6009 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 222936 | |
| Uppercase Letter | 55734 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 55734 | |
| a | 55734 | |
| u | 49725 | |
| l | 49725 | |
| b | 6009 | 2.7% |
| n | 6009 | 2.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 49725 | |
| U | 6009 | 10.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 278670 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 55734 | |
| a | 55734 | |
| R | 49725 | |
| u | 49725 | |
| l | 49725 | |
| U | 6009 | 2.2% |
| b | 6009 | 2.2% |
| n | 6009 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 278670 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 55734 | |
| a | 55734 | |
| R | 49725 | |
| u | 49725 | |
| l | 49725 | |
| U | 6009 | 2.2% |
| b | 6009 | 2.2% |
| n | 6009 | 2.2% |
State Name
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.6 KiB |
| KERALA |
|---|
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 334404 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | KERALA |
|---|---|
| 2nd row | KERALA |
| 3rd row | KERALA |
| 4th row | KERALA |
| 5th row | KERALA |
Common Values
| Value | Count | Frequency (%) |
| KERALA | 55734 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| kerala | 55734 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 111468 | |
| K | 55734 | |
| E | 55734 | |
| R | 55734 | |
| L | 55734 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 334404 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 111468 | |
| K | 55734 | |
| E | 55734 | |
| R | 55734 | |
| L | 55734 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 334404 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 111468 | |
| K | 55734 | |
| E | 55734 | |
| R | 55734 | |
| L | 55734 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 334404 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 111468 | |
| K | 55734 | |
| E | 55734 | |
| R | 55734 | |
| L | 55734 |
District Name
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.6 KiB |
| Kozhikode | |
|---|---|
| Palakkad | |
| Malappuram | |
| Kannur | |
| Thrissur | |
| Other values (9) |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 8.6672049 |
| Min length | 6 |
Characters and Unicode
| Total characters | 483058 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Kollam |
|---|---|
| 2nd row | Kollam |
| 3rd row | Kollam |
| 4th row | Kollam |
| 5th row | Palakkad |
Common Values
| Value | Count | Frequency (%) |
| Kozhikode | 6192 | |
| Palakkad | 5988 | |
| Malappuram | 5983 | |
| Kannur | 5314 | |
| Thrissur | 5023 | |
| Ernakulam | 4416 | |
| Alappuzha | 4239 | |
| Idukki | 3792 | |
| Kottayam | 3506 | |
| Kasargod | 2880 | 5.2% |
| Other values (4) | 8401 |
Length
| Value | Count | Frequency (%) |
| kozhikode | 6192 | |
| palakkad | 5988 | |
| malappuram | 5983 | |
| kannur | 5314 | |
| thrissur | 5023 | |
| ernakulam | 4416 | |
| alappuzha | 4239 | |
| idukki | 3792 | |
| kottayam | 3506 | |
| kasargod | 2880 | 5.2% |
| Other values (4) | 8401 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 95914 | |
| u | 34065 | 7.1% |
| r | 33937 | 7.0% |
| k | 30168 | 6.2% |
| l | 25330 | 5.2% |
| n | 23742 | 4.9% |
| h | 23666 | 4.9% |
| p | 23093 | 4.8% |
| o | 21122 | 4.4% |
| d | 20795 | 4.3% |
| Other values (17) | 151226 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 427324 | |
| Uppercase Letter | 55734 | 11.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 95914 | |
| u | 34065 | 8.0% |
| r | 33937 | 7.9% |
| k | 30168 | 7.1% |
| l | 25330 | 5.9% |
| n | 23742 | 5.6% |
| h | 23666 | 5.5% |
| p | 23093 | 5.4% |
| o | 21122 | 4.9% |
| d | 20795 | 4.9% |
| Other values (9) | 95492 |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 20244 | |
| T | 7672 | 13.8% |
| P | 7445 | 13.4% |
| M | 5983 | 10.7% |
| E | 4416 | 7.9% |
| A | 4239 | 7.6% |
| I | 3792 | 6.8% |
| W | 1943 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 483058 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 95914 | |
| u | 34065 | 7.1% |
| r | 33937 | 7.0% |
| k | 30168 | 6.2% |
| l | 25330 | 5.2% |
| n | 23742 | 4.9% |
| h | 23666 | 4.9% |
| p | 23093 | 4.8% |
| o | 21122 | 4.4% |
| d | 20795 | 4.3% |
| Other values (17) | 151226 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 483058 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 95914 | |
| u | 34065 | 7.1% |
| r | 33937 | 7.0% |
| k | 30168 | 6.2% |
| l | 25330 | 5.2% |
| n | 23742 | 4.9% |
| h | 23666 | 4.9% |
| p | 23093 | 4.8% |
| o | 21122 | 4.4% |
| d | 20795 | 4.3% |
| Other values (17) | 151226 |
Water_Body_Type
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.6 KiB |
| Ponds | |
|---|---|
| Water consv schemes/percolation tanks/check-dams | 3349 |
| Tank | 848 |
| Others | 463 |
| Reservoirs | 63 |
Length
| Max length | 48 |
|---|---|
| Median length | 5 |
| Mean length | 7.5825708 |
| Min length | 4 |
Characters and Unicode
| Total characters | 422607 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ponds |
|---|---|
| 2nd row | Ponds |
| 3rd row | Ponds |
| 4th row | Ponds |
| 5th row | Ponds |
Common Values
| Value | Count | Frequency (%) |
| Ponds | 51007 | |
| Water consv schemes/percolation tanks/check-dams | 3349 | 6.0% |
| Tank | 848 | 1.5% |
| Others | 463 | 0.8% |
| Reservoirs | 63 | 0.1% |
| Lakes | 4 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ponds | 51007 | |
| water | 3349 | 5.1% |
| consv | 3349 | 5.1% |
| schemes/percolation | 3349 | 5.1% |
| tanks/check-dams | 3349 | 5.1% |
| tank | 848 | 1.3% |
| others | 463 | 0.7% |
| reservoirs | 63 | 0.1% |
| lakes | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 68345 | |
| n | 61902 | |
| o | 61117 | |
| d | 54356 | |
| P | 51007 | |
| e | 17338 | 4.1% |
| c | 16745 | 4.0% |
| a | 14248 | 3.4% |
| t | 10510 | 2.5% |
| 10047 | 2.4% | |
| Other values (15) | 56992 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 346779 | |
| Uppercase Letter | 55734 | 13.2% |
| Space Separator | 10047 | 2.4% |
| Other Punctuation | 6698 | 1.6% |
| Dash Punctuation | 3349 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 68345 | |
| n | 61902 | |
| o | 61117 | |
| d | 54356 | |
| e | 17338 | 5.0% |
| c | 16745 | 4.8% |
| a | 14248 | 4.1% |
| t | 10510 | 3.0% |
| k | 7550 | 2.2% |
| r | 7287 | 2.1% |
| Other values (6) | 27381 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 51007 | |
| W | 3349 | 6.0% |
| T | 848 | 1.5% |
| O | 463 | 0.8% |
| R | 63 | 0.1% |
| L | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 10047 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 6698 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3349 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 402513 | |
| Common | 20094 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 68345 | |
| n | 61902 | |
| o | 61117 | |
| d | 54356 | |
| P | 51007 | |
| e | 17338 | 4.3% |
| c | 16745 | 4.2% |
| a | 14248 | 3.5% |
| t | 10510 | 2.6% |
| k | 7550 | 1.9% |
| Other values (12) | 39395 |
Common
| Value | Count | Frequency (%) |
| 10047 | ||
| / | 6698 | |
| - | 3349 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 422607 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 68345 | |
| n | 61902 | |
| o | 61117 | |
| d | 54356 | |
| P | 51007 | |
| e | 17338 | 4.1% |
| c | 16745 | 4.0% |
| a | 14248 | 3.4% |
| t | 10510 | 2.5% |
| 10047 | 2.4% | |
| Other values (15) | 56992 |
Water_Body_Status
Boolean
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.6 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 46550 | |
| False | 9184 | 16.5% |
Reason_for_Water_Body_Use
Categorical
HIGH CORRELATION 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.6 KiB |
| Irrigation | |
|---|---|
| Domestic/Drinking | |
| Not Specified | |
| Ground water recharge | |
| Religious | |
| Other values (4) |
Length
| Max length | 21 |
|---|---|
| Median length | 17 |
| Mean length | 12.821617 |
| Min length | 5 |
Characters and Unicode
| Total characters | 714600 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Domestic/Drinking |
|---|---|
| 2nd row | Domestic/Drinking |
| 3rd row | Not Specified |
| 4th row | Domestic/Drinking |
| 5th row | Not Specified |
Common Values
| Value | Count | Frequency (%) |
| Irrigation | 20038 | |
| Domestic/Drinking | 10192 | |
| Not Specified | 9184 | |
| Ground water recharge | 6199 | 11.1% |
| Religious | 3591 | 6.4% |
| Pisciculture | 2663 | 4.8% |
| Other | 2312 | 4.1% |
| Recreation | 1295 | 2.3% |
| Industrial | 260 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| irrigation | 20038 | |
| domestic/drinking | 10192 | |
| not | 9184 | |
| specified | 9184 | |
| ground | 6199 | 8.0% |
| water | 6199 | 8.0% |
| recharge | 6199 | 8.0% |
| religious | 3591 | 4.6% |
| pisciculture | 2663 | 3.4% |
| other | 2312 | 3.0% |
| Other values (2) | 1555 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 103083 | |
| r | 81594 | |
| e | 58313 | 8.2% |
| t | 52143 | 7.3% |
| o | 50499 | 7.1% |
| n | 48176 | 6.7% |
| g | 40020 | 5.6% |
| a | 33991 | 4.8% |
| c | 32196 | 4.5% |
| 21582 | 3.0% | |
| Other values (19) | 193003 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 607716 | |
| Uppercase Letter | 75110 | 10.5% |
| Space Separator | 21582 | 3.0% |
| Other Punctuation | 10192 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 103083 | |
| r | 81594 | |
| e | 58313 | |
| t | 52143 | |
| o | 50499 | |
| n | 48176 | |
| g | 40020 | 6.6% |
| a | 33991 | 5.6% |
| c | 32196 | 5.3% |
| s | 16706 | 2.7% |
| Other values (9) | 90995 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 20384 | |
| I | 20298 | |
| N | 9184 | |
| S | 9184 | |
| G | 6199 | 8.3% |
| R | 4886 | 6.5% |
| P | 2663 | 3.5% |
| O | 2312 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 21582 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 10192 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 682826 | |
| Common | 31774 | 4.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 103083 | |
| r | 81594 | |
| e | 58313 | 8.5% |
| t | 52143 | 7.6% |
| o | 50499 | 7.4% |
| n | 48176 | 7.1% |
| g | 40020 | 5.9% |
| a | 33991 | 5.0% |
| c | 32196 | 4.7% |
| D | 20384 | 3.0% |
| Other values (17) | 162427 |
Common
| Value | Count | Frequency (%) |
| 21582 | ||
| / | 10192 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 714600 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 103083 | |
| r | 81594 | |
| e | 58313 | 8.2% |
| t | 52143 | 7.3% |
| o | 50499 | 7.1% |
| n | 48176 | 6.7% |
| g | 40020 | 5.6% |
| a | 33991 | 4.8% |
| c | 32196 | 4.5% |
| 21582 | 3.0% | |
| Other values (19) | 193003 |
Scheme_Status_Reason
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.6 KiB |
| No reported problems | |
|---|---|
| Others | 4577 |
| Siltation | 2126 |
| Destroyed beyond repair | 1326 |
| Dried-up | 642 |
| Other values (3) | 513 |
Length
| Max length | 27 |
|---|---|
| Median length | 20 |
| Mean length | 18.281175 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1018883 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No reported problems |
|---|---|
| 2nd row | No reported problems |
| 3rd row | Siltation |
| 4th row | No reported problems |
| 5th row | Others |
Common Values
| Value | Count | Frequency (%) |
| No reported problems | 46550 | |
| Others | 4577 | 8.2% |
| Siltation | 2126 | 3.8% |
| Destroyed beyond repair | 1326 | 2.4% |
| Dried-up | 642 | 1.2% |
| Salinity | 287 | 0.5% |
| Construction | 183 | 0.3% |
| Due to industrial effluents | 43 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 46550 | |
| reported | 46550 | |
| problems | 46550 | |
| others | 4577 | 3.0% |
| siltation | 2126 | 1.4% |
| destroyed | 1326 | 0.9% |
| beyond | 1326 | 0.9% |
| repair | 1326 | 0.9% |
| dried-up | 642 | 0.4% |
| salinity | 287 | 0.2% |
| Other values (5) | 355 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 150302 | |
| r | 149073 | |
| o | 144837 | |
| 95881 | ||
| p | 95068 | |
| t | 57487 | 5.6% |
| s | 52722 | 5.2% |
| d | 49887 | 4.9% |
| l | 49049 | 4.8% |
| b | 47876 | 4.7% |
| Other values (15) | 126701 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 866626 | |
| Space Separator | 95881 | 9.4% |
| Uppercase Letter | 55734 | 5.5% |
| Dash Punctuation | 642 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 150302 | |
| r | 149073 | |
| o | 144837 | |
| p | 95068 | |
| t | 57487 | 6.6% |
| s | 52722 | 6.1% |
| d | 49887 | 5.8% |
| l | 49049 | 5.7% |
| b | 47876 | 5.5% |
| m | 46550 | 5.4% |
| Other values (8) | 23775 | 2.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 46550 | |
| O | 4577 | 8.2% |
| S | 2413 | 4.3% |
| D | 2011 | 3.6% |
| C | 183 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 95881 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 642 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 922360 | |
| Common | 96523 | 9.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 150302 | |
| r | 149073 | |
| o | 144837 | |
| p | 95068 | |
| t | 57487 | 6.2% |
| s | 52722 | 5.7% |
| d | 49887 | 5.4% |
| l | 49049 | 5.3% |
| b | 47876 | 5.2% |
| N | 46550 | 5.0% |
| Other values (13) | 79509 |
Common
| Value | Count | Frequency (%) |
| 95881 | ||
| - | 642 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1018883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 150302 | |
| r | 149073 | |
| o | 144837 | |
| 95881 | ||
| p | 95068 | |
| t | 57487 | 5.6% |
| s | 52722 | 5.2% |
| d | 49887 | 4.9% |
| l | 49049 | 4.8% |
| b | 47876 | 4.7% |
| Other values (15) | 126701 |
Water_Body_Nature
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.6 KiB |
| Man-made | |
|---|---|
| Natural |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.5907166 |
| Min length | 7 |
Characters and Unicode
| Total characters | 423061 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Natural |
|---|---|
| 2nd row | Natural |
| 3rd row | Natural |
| 4th row | Natural |
| 5th row | Natural |
Common Values
| Value | Count | Frequency (%) |
| Man-made | 32923 | |
| Natural | 22811 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| man-made | 32923 | |
| natural | 22811 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 111468 | |
| M | 32923 | 7.8% |
| n | 32923 | 7.8% |
| - | 32923 | 7.8% |
| m | 32923 | 7.8% |
| d | 32923 | 7.8% |
| e | 32923 | 7.8% |
| N | 22811 | 5.4% |
| t | 22811 | 5.4% |
| u | 22811 | 5.4% |
| Other values (2) | 45622 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 334404 | |
| Uppercase Letter | 55734 | 13.2% |
| Dash Punctuation | 32923 | 7.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 111468 | |
| n | 32923 | 9.8% |
| m | 32923 | 9.8% |
| d | 32923 | 9.8% |
| e | 32923 | 9.8% |
| t | 22811 | 6.8% |
| u | 22811 | 6.8% |
| r | 22811 | 6.8% |
| l | 22811 | 6.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 32923 | |
| N | 22811 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32923 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 390138 | |
| Common | 32923 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 111468 | |
| M | 32923 | 8.4% |
| n | 32923 | 8.4% |
| m | 32923 | 8.4% |
| d | 32923 | 8.4% |
| e | 32923 | 8.4% |
| N | 22811 | 5.8% |
| t | 22811 | 5.8% |
| u | 22811 | 5.8% |
| r | 22811 | 5.8% |
Common
| Value | Count | Frequency (%) |
| - | 32923 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 423061 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 111468 | |
| M | 32923 | 7.8% |
| n | 32923 | 7.8% |
| - | 32923 | 7.8% |
| m | 32923 | 7.8% |
| d | 32923 | 7.8% |
| e | 32923 | 7.8% |
| N | 22811 | 5.4% |
| t | 22811 | 5.4% |
| u | 22811 | 5.4% |
| Other values (2) | 45622 |
construcion_year
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 177 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 22811 |
| Missing (%) | 40.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1987.9537 |
| Minimum | 1519 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 435.6 KiB |
Quantile statistics
| Minimum | 1519 |
|---|---|
| 5-th percentile | 1940 |
| Q1 | 1980 |
| median | 1995 |
| Q3 | 2005 |
| 95-th percentile | 2016 |
| Maximum | 2020 |
| Range | 501 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 29.667765 |
|---|---|
| Coefficient of variation (CV) | 0.01492377 |
| Kurtosis | 41.113718 |
| Mean | 1987.9537 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -4.254521 |
| Sum | 65449401 |
| Variance | 880.17625 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1990 | 2773 | 5.0% |
| 1980 | 2390 | 4.3% |
| 2000 | 2384 | 4.3% |
| 1995 | 1255 | 2.3% |
| 1970 | 1138 | 2.0% |
| 1985 | 1127 | 2.0% |
| 2010 | 1021 | 1.8% |
| 2015 | 878 | 1.6% |
| 1998 | 851 | 1.5% |
| 2005 | 800 | 1.4% |
| Other values (167) | 18306 | |
| (Missing) | 22811 |
| Value | Count | Frequency (%) |
| 1519 | 9 | |
| 1520 | 3 | < 0.1% |
| 1565 | 1 | < 0.1% |
| 1569 | 1 | < 0.1% |
| 1580 | 1 | < 0.1% |
| 1600 | 2 | < 0.1% |
| 1618 | 1 | < 0.1% |
| 1619 | 3 | < 0.1% |
| 1620 | 1 | < 0.1% |
| 1650 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2020 | 27 | < 0.1% |
| 2019 | 174 | 0.3% |
| 2018 | 697 | |
| 2017 | 709 | |
| 2016 | 682 | |
| 2015 | 878 | |
| 2014 | 777 | |
| 2013 | 537 | |
| 2012 | 552 | |
| 2011 | 281 | 0.5% |
construction_cost
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 1092 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 22811 |
| Missing (%) | 40.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1013942.6 |
| Minimum | 0 |
|---|---|
| Maximum | 7.6 × 109 |
| Zeros | 766 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 435.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 300 |
| Q1 | 5000 |
| median | 20000 |
| Q3 | 75000 |
| 95-th percentile | 800000 |
| Maximum | 7.6 × 109 |
| Range | 7.6 × 109 |
| Interquartile range (IQR) | 70000 |
Descriptive statistics
| Standard deviation | 51358566 |
|---|---|
| Coefficient of variation (CV) | 50.652342 |
| Kurtosis | 16256.15 |
| Mean | 1013942.6 |
| Median Absolute Deviation (MAD) | 19000 |
| Skewness | 119.56738 |
| Sum | 3.3382032 × 1010 |
| Variance | 2.6377023 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 2353 | 4.2% |
| 5000 | 2031 | 3.6% |
| 50000 | 2029 | 3.6% |
| 20000 | 1450 | 2.6% |
| 100000 | 1344 | 2.4% |
| 1000 | 1245 | 2.2% |
| 25000 | 1206 | 2.2% |
| 15000 | 1178 | 2.1% |
| 2000 | 1103 | 2.0% |
| 30000 | 990 | 1.8% |
| Other values (1082) | 17994 | |
| (Missing) | 22811 |
| Value | Count | Frequency (%) |
| 0 | 766 | |
| 1 | 8 | < 0.1% |
| 2 | 6 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 4 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 36 | 0.1% |
| 12 | 2 | < 0.1% |
| 15 | 1 | < 0.1% |
| 20 | 30 | 0.1% |
| Value | Count | Frequency (%) |
| 7600000000 | 1 | |
| 4420000000 | 1 | |
| 1200000000 | 1 | |
| 1075700000 | 1 | |
| 1042000000 | 1 | |
| 1000000000 | 1 | |
| 930000000 | 1 | |
| 871200000 | 1 | |
| 800000000 | 1 | |
| 610000000 | 1 |
Renovation_Year
Real number (ℝ)
MISSING 
| Distinct | 68 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 41682 |
| Missing (%) | 74.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2008.9353 |
| Minimum | 1949 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 435.6 KiB |
Quantile statistics
| Minimum | 1949 |
|---|---|
| 5-th percentile | 1989 |
| Q1 | 2003 |
| median | 2013 |
| Q3 | 2017 |
| 95-th percentile | 2018 |
| Maximum | 2020 |
| Range | 71 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 10.639445 |
|---|---|
| Coefficient of variation (CV) | 0.0052960616 |
| Kurtosis | 4.721049 |
| Mean | 2008.9353 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -1.8692308 |
| Sum | 28229559 |
| Variance | 113.19779 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2018 | 1926 | 3.5% |
| 2015 | 1397 | 2.5% |
| 2017 | 1284 | 2.3% |
| 2016 | 1132 | 2.0% |
| 2010 | 921 | 1.7% |
| 2000 | 912 | 1.6% |
| 2014 | 574 | 1.0% |
| 2005 | 540 | 1.0% |
| 2012 | 515 | 0.9% |
| 2019 | 500 | 0.9% |
| Other values (58) | 4351 | 7.8% |
| (Missing) | 41682 |
| Value | Count | Frequency (%) |
| 1949 | 1 | < 0.1% |
| 1950 | 32 | |
| 1951 | 1 | < 0.1% |
| 1952 | 5 | < 0.1% |
| 1954 | 1 | < 0.1% |
| 1955 | 1 | < 0.1% |
| 1956 | 1 | < 0.1% |
| 1958 | 3 | < 0.1% |
| 1959 | 2 | < 0.1% |
| 1960 | 38 |
| Value | Count | Frequency (%) |
| 2020 | 33 | 0.1% |
| 2019 | 500 | 0.9% |
| 2018 | 1926 | |
| 2017 | 1284 | |
| 2016 | 1132 | |
| 2015 | 1397 | |
| 2014 | 574 | 1.0% |
| 2013 | 341 | 0.6% |
| 2012 | 515 | 0.9% |
| 2011 | 197 | 0.4% |
renovation_cost
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 490 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 41682 |
| Missing (%) | 74.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 179471.2 |
| Minimum | 1 |
|---|---|
| Maximum | 5.5 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 435.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1500 |
| Q1 | 5000 |
| median | 10000 |
| Q3 | 50000 |
| 95-th percentile | 400000 |
| Maximum | 5.5 × 108 |
| Range | 5.5 × 108 |
| Interquartile range (IQR) | 45000 |
Descriptive statistics
| Standard deviation | 5215352.3 |
|---|---|
| Coefficient of variation (CV) | 29.05955 |
| Kurtosis | 8977.782 |
| Mean | 179471.2 |
| Median Absolute Deviation (MAD) | 8000 |
| Skewness | 89.246628 |
| Sum | 2.5219293 × 109 |
| Variance | 2.71999 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 1670 | 3.0% |
| 10000 | 1486 | 2.7% |
| 50000 | 782 | 1.4% |
| 20000 | 617 | 1.1% |
| 100000 | 542 | 1.0% |
| 3000 | 535 | 1.0% |
| 15000 | 497 | 0.9% |
| 2000 | 479 | 0.9% |
| 8000 | 477 | 0.9% |
| 6000 | 475 | 0.9% |
| Other values (480) | 6492 | 11.6% |
| (Missing) | 41682 |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 10 | 6 | |
| 20 | 8 | |
| 30 | 1 | < 0.1% |
| 50 | 6 | |
| 60 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 550000000 | 1 | |
| 180000000 | 1 | |
| 170000000 | 1 | |
| 69600000 | 1 | |
| 60000000 | 1 | |
| 55000000 | 1 | |
| 53500000 | 1 | |
| 30000000 | 1 | |
| 29000000 | 1 | |
| 18000000 | 1 |
Repair_Renovation_Status
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.6 KiB |
| False | |
|---|---|
| True | 453 |
| Value | Count | Frequency (%) |
| False | 55281 | |
| True | 453 | 0.8% |
Original_Storage_Capacity
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 2212 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 172467.46 |
| Minimum | 1 |
|---|---|
| Maximum | 1.6979 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 435.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 120 |
| median | 350 |
| Q3 | 1500 |
| 95-th percentile | 172467.46 |
| Maximum | 1.6979 × 109 |
| Range | 1.6979 × 109 |
| Interquartile range (IQR) | 1380 |
Descriptive statistics
| Standard deviation | 11803863 |
|---|---|
| Coefficient of variation (CV) | 68.441102 |
| Kurtosis | 13579.805 |
| Mean | 172467.46 |
| Median Absolute Deviation (MAD) | 290 |
| Skewness | 109.94653 |
| Sum | 9.6123015 × 109 |
| Variance | 1.3933119 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 172467.4606 | 3812 | 6.8% |
| 120 | 1909 | 3.4% |
| 200 | 1821 | 3.3% |
| 60 | 1779 | 3.2% |
| 300 | 1688 | 3.0% |
| 100 | 1436 | 2.6% |
| 240 | 1300 | 2.3% |
| 80 | 1203 | 2.2% |
| 40 | 1147 | 2.1% |
| 600 | 1133 | 2.0% |
| Other values (2202) | 38506 |
| Value | Count | Frequency (%) |
| 1 | 12 | < 0.1% |
| 2 | 21 | < 0.1% |
| 3 | 24 | < 0.1% |
| 4 | 27 | < 0.1% |
| 5 | 19 | < 0.1% |
| 6 | 40 | 0.1% |
| 7 | 28 | 0.1% |
| 8 | 27 | < 0.1% |
| 9 | 54 | 0.1% |
| 10 | 290 |
| Value | Count | Frequency (%) |
| 1697900000 | 1 | |
| 1459490000 | 1 | |
| 1089800000 | 1 | |
| 708200000 | 1 | |
| 504920000 | 1 | |
| 454140000 | 1 | |
| 446570000 | 1 | |
| 443230000 | 1 | |
| 215340000 | 1 | |
| 201000000 | 1 |
Present_Storage_Capacity
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 2424 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 142922.56 |
| Minimum | 1 |
|---|---|
| Maximum | 1.6979 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 435.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 90 |
| median | 242 |
| Q3 | 1000 |
| 95-th percentile | 142922.56 |
| Maximum | 1.6979 × 109 |
| Range | 1.6979 × 109 |
| Interquartile range (IQR) | 910 |
Descriptive statistics
| Standard deviation | 10937592 |
|---|---|
| Coefficient of variation (CV) | 76.5281 |
| Kurtosis | 16997.421 |
| Mean | 142922.56 |
| Median Absolute Deviation (MAD) | 202 |
| Skewness | 124.61758 |
| Sum | 7.965646 × 109 |
| Variance | 1.1963092 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 142922.5615 | 3812 | 6.8% |
| 100 | 2143 | 3.8% |
| 200 | 1652 | 3.0% |
| 150 | 1563 | 2.8% |
| 300 | 1334 | 2.4% |
| 50 | 1247 | 2.2% |
| 80 | 1067 | 1.9% |
| 400 | 899 | 1.6% |
| 120 | 883 | 1.6% |
| 60 | 863 | 1.5% |
| Other values (2414) | 40271 |
| Value | Count | Frequency (%) |
| 1 | 63 | 0.1% |
| 2 | 57 | 0.1% |
| 3 | 51 | 0.1% |
| 4 | 54 | 0.1% |
| 5 | 113 | 0.2% |
| 6 | 89 | 0.2% |
| 7 | 74 | 0.1% |
| 8 | 124 | 0.2% |
| 9 | 102 | 0.2% |
| 10 | 451 |
| Value | Count | Frequency (%) |
| 1697900000 | 1 | |
| 1404000000 | 1 | |
| 1064050000 | 1 | |
| 476160000 | 1 | |
| 352500000 | 1 | |
| 320000000 | 1 | |
| 254000000 | 1 | |
| 222500000 | 1 | |
| 157600000 | 1 | |
| 111890000 | 1 |
filled_up_storage_name
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.6 KiB |
| Full | |
|---|---|
| Upto 3/4 | |
| Upto 1/2 | |
| Upto 1/4 | 1045 |
| Nil/Negligible filled up | 439 |
Length
| Max length | 24 |
|---|---|
| Median length | 4 |
| Mean length | 5.7406969 |
| Min length | 4 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Full |
|---|---|
| 2nd row | Full |
| 3rd row | Full |
| 4th row | Full |
| 5th row | Upto 3/4 |
Common Values
| Value | Count | Frequency (%) |
| Full | 33236 | |
| Upto 3/4 | 17233 | |
| Upto 1/2 | 3781 | 6.8% |
| Upto 1/4 | 1045 | 1.9% |
| Nil/Negligible filled up | 439 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| full | 33236 | |
| upto | 22059 | |
| 3/4 | 17233 | |
| 1/2 | 3781 | 4.8% |
| 1/4 | 1045 | 1.3% |
| nil/negligible | 439 | 0.6% |
| filled | 439 | 0.6% |
| up | 439 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 68667 | |
| u | 33675 | |
| F | 33236 | |
| 22937 | 7.2% | |
| / | 22498 | 7.0% |
| p | 22498 | 7.0% |
| U | 22059 | 6.9% |
| t | 22059 | 6.9% |
| o | 22059 | 6.9% |
| 4 | 18278 | 5.7% |
| Other values (10) | 31986 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 174226 | |
| Uppercase Letter | 56173 | 17.6% |
| Decimal Number | 44118 | 13.8% |
| Space Separator | 22937 | 7.2% |
| Other Punctuation | 22498 | 7.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 68667 | |
| u | 33675 | |
| p | 22498 | 12.9% |
| t | 22059 | 12.7% |
| o | 22059 | 12.7% |
| i | 1756 | 1.0% |
| e | 1317 | 0.8% |
| g | 878 | 0.5% |
| b | 439 | 0.3% |
| f | 439 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 18278 | |
| 3 | 17233 | |
| 1 | 4826 | 10.9% |
| 2 | 3781 | 8.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 33236 | |
| U | 22059 | |
| N | 878 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 22937 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 22498 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 230399 | |
| Common | 89553 | 28.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 68667 | |
| u | 33675 | |
| F | 33236 | |
| p | 22498 | 9.8% |
| U | 22059 | 9.6% |
| t | 22059 | 9.6% |
| o | 22059 | 9.6% |
| i | 1756 | 0.8% |
| e | 1317 | 0.6% |
| N | 878 | 0.4% |
| Other values (4) | 2195 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 22937 | ||
| / | 22498 | |
| 4 | 18278 | |
| 3 | 17233 | |
| 1 | 4826 | 5.4% |
| 2 | 3781 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 68667 | |
| u | 33675 | |
| F | 33236 | |
| 22937 | 7.2% | |
| / | 22498 | 7.0% |
| p | 22498 | 7.0% |
| U | 22059 | 6.9% |
| t | 22059 | 6.9% |
| o | 22059 | 6.9% |
| 4 | 18278 | 5.7% |
| Other values (10) | 31986 |
filled_up_storage_space_name
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.6 KiB |
| Filled up every year | |
|---|---|
| Usually filled up | |
| Rarely filled up | |
| Never filled up | 1506 |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 18.539976 |
| Min length | 15 |
Characters and Unicode
| Total characters | 1033307 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Filled up every year |
|---|---|
| 2nd row | Filled up every year |
| 3rd row | Filled up every year |
| 4th row | Filled up every year |
| 5th row | Usually filled up |
Common Values
| Value | Count | Frequency (%) |
| Filled up every year | 31424 | |
| Usually filled up | 17373 | |
| Rarely filled up | 5431 | 9.7% |
| Never filled up | 1506 | 2.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| filled | 55734 | |
| up | 55734 | |
| every | 31424 | |
| year | 31424 | |
| usually | 17373 | 8.7% |
| rarely | 5431 | 2.7% |
| never | 1506 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 158449 | |
| l | 151645 | |
| 142892 | ||
| y | 85652 | |
| u | 73107 | |
| r | 69785 | |
| d | 55734 | 5.4% |
| p | 55734 | 5.4% |
| i | 55734 | 5.4% |
| a | 54228 | 5.2% |
| Other values (7) | 130347 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 834681 | |
| Space Separator | 142892 | 13.8% |
| Uppercase Letter | 55734 | 5.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 158449 | |
| l | 151645 | |
| y | 85652 | |
| u | 73107 | |
| r | 69785 | |
| d | 55734 | 6.7% |
| p | 55734 | 6.7% |
| i | 55734 | 6.7% |
| a | 54228 | 6.5% |
| v | 32930 | 3.9% |
| Other values (2) | 41683 | 5.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 31424 | |
| U | 17373 | |
| R | 5431 | 9.7% |
| N | 1506 | 2.7% |
Space Separator
| Value | Count | Frequency (%) |
| 142892 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 890415 | |
| Common | 142892 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 158449 | |
| l | 151645 | |
| y | 85652 | |
| u | 73107 | |
| r | 69785 | |
| d | 55734 | 6.3% |
| p | 55734 | 6.3% |
| i | 55734 | 6.3% |
| a | 54228 | 6.1% |
| v | 32930 | 3.7% |
| Other values (6) | 97417 |
Common
| Value | Count | Frequency (%) |
| 142892 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1033307 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 158449 | |
| l | 151645 | |
| 142892 | ||
| y | 85652 | |
| u | 73107 | |
| r | 69785 | |
| d | 55734 | 5.4% |
| p | 55734 | 5.4% |
| i | 55734 | 5.4% |
| a | 54228 | 5.2% |
| Other values (7) | 130347 |
no_people_benefited_by_water_body
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 278 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 681.37848 |
| Minimum | 1 |
|---|---|
| Maximum | 5000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 435.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 18 |
| Q3 | 100 |
| 95-th percentile | 681 |
| Maximum | 5000000 |
| Range | 4999999 |
| Interquartile range (IQR) | 94 |
Descriptive statistics
| Standard deviation | 29400.983 |
|---|---|
| Coefficient of variation (CV) | 43.149268 |
| Kurtosis | 16821.517 |
| Mean | 681.37848 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 114.81877 |
| Sum | 37975948 |
| Variance | 8.6441779 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 681 | 9184 | |
| 10 | 8220 | |
| 1 | 5020 | 9.0% |
| 5 | 4436 | 8.0% |
| 50 | 2954 | 5.3% |
| 20 | 2537 | 4.6% |
| 100 | 2147 | 3.9% |
| 4 | 2057 | 3.7% |
| 15 | 1772 | 3.2% |
| 30 | 1672 | 3.0% |
| Other values (268) | 15735 |
| Value | Count | Frequency (%) |
| 1 | 5020 | |
| 2 | 1018 | 1.8% |
| 3 | 814 | 1.5% |
| 4 | 2057 | 3.7% |
| 5 | 4436 | |
| 6 | 1377 | 2.5% |
| 7 | 677 | 1.2% |
| 8 | 959 | 1.7% |
| 9 | 233 | 0.4% |
| 10 | 8220 |
| Value | Count | Frequency (%) |
| 5000000 | 1 | < 0.1% |
| 2500000 | 1 | < 0.1% |
| 2000000 | 2 | < 0.1% |
| 1100000 | 1 | < 0.1% |
| 1000000 | 2 | < 0.1% |
| 750000 | 1 | < 0.1% |
| 675000 | 1 | < 0.1% |
| 667800 | 1 | < 0.1% |
| 500000 | 8 | |
| 450000 | 2 | < 0.1% |
reason_water_body_in_use_name2
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 30662 |
| Missing (%) | 55.0% |
| Memory size | 435.6 KiB |
| Ground water recharge | |
|---|---|
| Domestic/Drinking | |
| Other | |
| Pisciculture | |
| Recreation | 1226 |
| Other values (3) |
Length
| Max length | 21 |
|---|---|
| Median length | 21 |
| Mean length | 17.048141 |
| Min length | 5 |
Characters and Unicode
| Total characters | 427431 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ground water recharge |
|---|---|
| 2nd row | Ground water recharge |
| 3rd row | Ground water recharge |
| 4th row | Ground water recharge |
| 5th row | Ground water recharge |
Common Values
| Value | Count | Frequency (%) |
| Ground water recharge | 12843 | |
| Domestic/Drinking | 5984 | 10.7% |
| Other | 1732 | 3.1% |
| Pisciculture | 1485 | 2.7% |
| Recreation | 1226 | 2.2% |
| Irrigation | 873 | 1.6% |
| Religious | 760 | 1.4% |
| Industrial | 169 | 0.3% |
| (Missing) | 30662 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ground | 12843 | |
| water | 12843 | |
| recharge | 12843 | |
| domestic/drinking | 5984 | |
| other | 1732 | 3.4% |
| pisciculture | 1485 | 2.9% |
| recreation | 1226 | 2.4% |
| irrigation | 873 | 1.7% |
| religious | 760 | 1.5% |
| industrial | 169 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 63714 | |
| e | 50942 | |
| a | 27954 | 6.5% |
| n | 27079 | 6.3% |
| 25686 | 6.0% | |
| i | 25583 | 6.0% |
| t | 24312 | 5.7% |
| c | 23023 | 5.4% |
| o | 21686 | 5.1% |
| g | 20460 | 4.8% |
| Other values (15) | 116992 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 364705 | |
| Uppercase Letter | 31056 | 7.3% |
| Space Separator | 25686 | 6.0% |
| Other Punctuation | 5984 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 63714 | |
| e | 50942 | |
| a | 27954 | |
| n | 27079 | |
| i | 25583 | |
| t | 24312 | 6.7% |
| c | 23023 | 6.3% |
| o | 21686 | 5.9% |
| g | 20460 | 5.6% |
| u | 16742 | 4.6% |
| Other values (7) | 63210 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 12843 | |
| D | 11968 | |
| R | 1986 | 6.4% |
| O | 1732 | 5.6% |
| P | 1485 | 4.8% |
| I | 1042 | 3.4% |
Space Separator
| Value | Count | Frequency (%) |
| 25686 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 5984 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 395761 | |
| Common | 31670 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 63714 | |
| e | 50942 | |
| a | 27954 | 7.1% |
| n | 27079 | 6.8% |
| i | 25583 | 6.5% |
| t | 24312 | 6.1% |
| c | 23023 | 5.8% |
| o | 21686 | 5.5% |
| g | 20460 | 5.2% |
| u | 16742 | 4.2% |
| Other values (13) | 94266 |
Common
| Value | Count | Frequency (%) |
| 25686 | ||
| / | 5984 | 18.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 427431 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 63714 | |
| e | 50942 | |
| a | 27954 | 6.5% |
| n | 27079 | 6.3% |
| 25686 | 6.0% | |
| i | 25583 | 6.0% |
| t | 24312 | 5.7% |
| c | 23023 | 5.4% |
| o | 21686 | 5.1% |
| g | 20460 | 4.8% |
| Other values (15) | 116992 |
reason_water_body_in_use_name3
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 43380 |
| Missing (%) | 77.8% |
| Memory size | 435.6 KiB |
| Ground water recharge | |
|---|---|
| Other | |
| Domestic/Drinking | |
| Irrigation | |
| Recreation | 356 |
| Other values (3) |
Length
| Max length | 21 |
|---|---|
| Median length | 17 |
| Mean length | 12.994334 |
| Min length | 5 |
Characters and Unicode
| Total characters | 160532 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Pisciculture |
|---|---|
| 2nd row | Pisciculture |
| 3rd row | Pisciculture |
| 4th row | Pisciculture |
| 5th row | Pisciculture |
Common Values
| Value | Count | Frequency (%) |
| Ground water recharge | 4797 | 8.6% |
| Other | 4727 | 8.5% |
| Domestic/Drinking | 1054 | 1.9% |
| Irrigation | 892 | 1.6% |
| Recreation | 356 | 0.6% |
| Pisciculture | 318 | 0.6% |
| Religious | 154 | 0.3% |
| Industrial | 56 | 0.1% |
| (Missing) | 43380 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ground | 4797 | |
| water | 4797 | |
| recharge | 4797 | |
| other | 4727 | |
| domestic/drinking | 1054 | 4.8% |
| irrigation | 892 | 4.1% |
| recreation | 356 | 1.6% |
| pisciculture | 318 | 1.4% |
| religious | 154 | 0.7% |
| industrial | 56 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 27483 | |
| e | 21356 | |
| t | 12200 | 7.6% |
| a | 10898 | 6.8% |
| 9594 | 6.0% | |
| h | 9524 | 5.9% |
| n | 8209 | 5.1% |
| o | 7253 | 4.5% |
| g | 6897 | 4.3% |
| c | 6843 | 4.3% |
| Other values (15) | 40275 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 136476 | |
| Uppercase Letter | 13408 | 8.4% |
| Space Separator | 9594 | 6.0% |
| Other Punctuation | 1054 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 27483 | |
| e | 21356 | |
| t | 12200 | |
| a | 10898 | 8.0% |
| h | 9524 | 7.0% |
| n | 8209 | 6.0% |
| o | 7253 | 5.3% |
| g | 6897 | 5.1% |
| c | 6843 | 5.0% |
| i | 6302 | 4.6% |
| Other values (7) | 19511 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 4797 | |
| O | 4727 | |
| D | 2108 | |
| I | 948 | 7.1% |
| R | 510 | 3.8% |
| P | 318 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 9594 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1054 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 149884 | |
| Common | 10648 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 27483 | |
| e | 21356 | |
| t | 12200 | 8.1% |
| a | 10898 | 7.3% |
| h | 9524 | 6.4% |
| n | 8209 | 5.5% |
| o | 7253 | 4.8% |
| g | 6897 | 4.6% |
| c | 6843 | 4.6% |
| i | 6302 | 4.2% |
| Other values (13) | 32919 |
Common
| Value | Count | Frequency (%) |
| 9594 | ||
| / | 1054 | 9.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 160532 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 27483 | |
| e | 21356 | |
| t | 12200 | 7.6% |
| a | 10898 | 6.8% |
| 9594 | 6.0% | |
| h | 9524 | 5.9% |
| n | 8209 | 5.1% |
| o | 7253 | 4.5% |
| g | 6897 | 4.3% |
| c | 6843 | 4.3% |
| Other values (15) | 40275 |
| Area_Type | District Name | Original_Storage_Capacity | Present_Storage_Capacity | Reason_for_Water_Body_Use | Renovation_Year | Repair_Renovation_Status | Scheme_Status_Reason | Water_Body_Nature | Water_Body_Status | Water_Body_Type | construcion_year | construction_cost | filled_up_storage_name | filled_up_storage_space_name | no_people_benefited_by_water_body | reason_water_body_in_use_name2 | reason_water_body_in_use_name3 | renovation_cost | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Area_Type | 1.000 | 0.147 | 0.049 | 0.047 | 0.137 | 0.027 | 0.026 | 0.037 | 0.052 | 0.016 | 0.020 | -0.052 | -0.027 | 0.026 | 0.024 | 0.062 | 0.075 | 0.098 | 0.082 |
| District Name | 0.147 | 1.000 | 0.166 | 0.167 | 0.167 | -0.062 | 0.230 | 0.086 | 0.293 | 0.130 | 0.116 | 0.122 | 0.064 | 0.170 | 0.200 | 0.026 | 0.137 | 0.176 | 0.140 |
| Original_Storage_Capacity | 0.049 | 0.166 | 1.000 | 0.977 | 0.011 | -0.033 | 0.000 | 0.000 | 0.004 | 0.000 | 0.178 | -0.065 | 0.202 | 0.000 | 0.000 | 0.245 | 0.000 | 0.000 | 0.274 |
| Present_Storage_Capacity | 0.047 | 0.167 | 0.977 | 1.000 | 0.016 | -0.024 | 0.000 | 0.000 | 0.000 | 0.000 | 0.149 | -0.046 | 0.210 | 0.000 | 0.000 | 0.225 | 0.000 | 0.000 | 0.271 |
| Reason_for_Water_Body_Use | 0.137 | 0.167 | 0.011 | 0.016 | 1.000 | -0.044 | 0.073 | 0.378 | 0.039 | 1.000 | 0.062 | -0.063 | -0.047 | 0.068 | 0.057 | 0.222 | 0.227 | 0.206 | -0.006 |
| Renovation_Year | 0.027 | -0.062 | -0.033 | -0.024 | -0.044 | 1.000 | 0.042 | 0.079 | 0.050 | 0.130 | 0.036 | 0.294 | 0.192 | 0.045 | 0.043 | -0.134 | 0.036 | 0.061 | 0.045 |
| Repair_Renovation_Status | 0.026 | 0.230 | 0.000 | 0.000 | 0.073 | 0.042 | 1.000 | 0.018 | 0.010 | 0.019 | 0.042 | 0.098 | 0.085 | 0.009 | 0.029 | 0.008 | 0.029 | 0.024 | 0.065 |
| Scheme_Status_Reason | 0.037 | 0.086 | 0.000 | 0.000 | 0.378 | 0.079 | 0.018 | 1.000 | 0.025 | 1.000 | 0.039 | -0.046 | -0.048 | 0.088 | 0.071 | 0.328 | 1.000 | 1.000 | -0.072 |
| Water_Body_Nature | 0.052 | 0.293 | 0.004 | 0.000 | 0.039 | 0.050 | 0.010 | 0.025 | 1.000 | 0.012 | 0.035 | 0.033 | 0.033 | 0.052 | 0.050 | 0.037 | 0.054 | 0.044 | 0.008 |
| Water_Body_Status | 0.016 | 0.130 | 0.000 | 0.000 | 1.000 | 0.130 | 0.019 | 1.000 | 0.012 | 1.000 | 0.046 | 0.077 | 0.041 | 0.123 | 0.073 | -0.632 | 1.000 | 1.000 | 0.119 |
| Water_Body_Type | 0.020 | 0.116 | 0.178 | 0.149 | 0.062 | 0.036 | 0.042 | 0.039 | 0.035 | 0.046 | 1.000 | 0.119 | 0.284 | 0.111 | 0.138 | 0.125 | 0.047 | 0.041 | 0.099 |
| construcion_year | -0.052 | 0.122 | -0.065 | -0.046 | -0.063 | 0.294 | 0.098 | -0.046 | 0.033 | 0.077 | 0.119 | 1.000 | 0.522 | 0.040 | 0.037 | -0.105 | 0.062 | 0.045 | -0.099 |
| construction_cost | -0.027 | 0.064 | 0.202 | 0.210 | -0.047 | 0.192 | 0.085 | -0.048 | 0.033 | 0.041 | 0.284 | 0.522 | 1.000 | 0.000 | 0.000 | 0.127 | 0.000 | 0.000 | 0.159 |
| filled_up_storage_name | 0.026 | 0.170 | 0.000 | 0.000 | 0.068 | 0.045 | 0.009 | 0.088 | 0.052 | 0.123 | 0.111 | 0.040 | 0.000 | 1.000 | 0.452 | -0.000 | 0.037 | 0.067 | 0.103 |
| filled_up_storage_space_name | 0.024 | 0.200 | 0.000 | 0.000 | 0.057 | 0.043 | 0.029 | 0.071 | 0.050 | 0.073 | 0.138 | 0.037 | 0.000 | 0.452 | 1.000 | -0.005 | 0.029 | 0.067 | 0.040 |
| no_people_benefited_by_water_body | 0.062 | 0.026 | 0.245 | 0.225 | 0.222 | -0.134 | 0.008 | 0.328 | 0.037 | -0.632 | 0.125 | -0.105 | 0.127 | -0.000 | -0.005 | 1.000 | 0.000 | 0.000 | 0.162 |
| reason_water_body_in_use_name2 | 0.075 | 0.137 | 0.000 | 0.000 | 0.227 | 0.036 | 0.029 | 1.000 | 0.054 | 1.000 | 0.047 | 0.062 | 0.000 | 0.037 | 0.029 | 0.000 | 1.000 | 0.286 | 0.033 |
| reason_water_body_in_use_name3 | 0.098 | 0.176 | 0.000 | 0.000 | 0.206 | 0.061 | 0.024 | 1.000 | 0.044 | 1.000 | 0.041 | 0.045 | 0.000 | 0.067 | 0.067 | 0.000 | 0.286 | 1.000 | 0.048 |
| renovation_cost | 0.082 | 0.140 | 0.274 | 0.271 | -0.006 | 0.045 | 0.065 | -0.072 | 0.008 | 0.119 | 0.099 | -0.099 | 0.159 | 0.103 | 0.040 | 0.162 | 0.033 | 0.048 | 1.000 |
| Area_Type | State Name | District Name | Water_Body_Type | Water_Body_Status | Reason_for_Water_Body_Use | Scheme_Status_Reason | Water_Body_Nature | construcion_year | construction_cost | Renovation_Year | renovation_cost | Repair_Renovation_Status | Original_Storage_Capacity | Present_Storage_Capacity | filled_up_storage_name | filled_up_storage_space_name | no_people_benefited_by_water_body | reason_water_body_in_use_name2 | reason_water_body_in_use_name3 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Rural | KERALA | Kollam | Ponds | Yes | Domestic/Drinking | No reported problems | Natural | 1989.0 | 25000.0 | 2016.0 | 6000.0 | No | 60.0 | 50.0 | Full | Filled up every year | 1 | Ground water recharge | Pisciculture |
| 1 | Rural | KERALA | Kollam | Ponds | Yes | Domestic/Drinking | No reported problems | Natural | NaN | NaN | 2015.0 | 5000.0 | No | 60.0 | 50.0 | Full | Filled up every year | 13 | Ground water recharge | Pisciculture |
| 2 | Rural | KERALA | Kollam | Ponds | No | Not Specified | Siltation | Natural | NaN | NaN | 2014.0 | 5700.0 | No | 240.0 | 150.0 | Full | Filled up every year | 681 | NaN | NaN |
| 3 | Rural | KERALA | Kollam | Ponds | Yes | Domestic/Drinking | No reported problems | Natural | NaN | NaN | 2015.0 | 5000.0 | No | 30.0 | 26.0 | Full | Filled up every year | 12 | Ground water recharge | Pisciculture |
| 4 | Rural | KERALA | Palakkad | Ponds | No | Not Specified | Others | Natural | NaN | NaN | 2002.0 | 80000.0 | No | 2830.0 | 2000.0 | Upto 3/4 | Usually filled up | 681 | NaN | NaN |
| 5 | Rural | KERALA | Kollam | Ponds | No | Not Specified | Siltation | Natural | NaN | NaN | 2017.0 | 5000.0 | No | 240.0 | 180.0 | Full | Filled up every year | 681 | NaN | NaN |
| 6 | Rural | KERALA | Kollam | Ponds | No | Not Specified | Siltation | Natural | NaN | NaN | 2015.0 | 5000.0 | No | 120.0 | 100.0 | Upto 3/4 | Filled up every year | 681 | NaN | NaN |
| 7 | Rural | KERALA | Kollam | Ponds | Yes | Domestic/Drinking | No reported problems | Natural | NaN | NaN | 2016.0 | 6000.0 | No | 60.0 | 50.0 | Full | Filled up every year | 1 | Ground water recharge | Pisciculture |
| 8 | Rural | KERALA | Kollam | Ponds | Yes | Domestic/Drinking | No reported problems | Natural | NaN | NaN | 2014.0 | 3000.0 | No | 7500.0 | 5250.0 | Full | Filled up every year | 40 | Ground water recharge | Pisciculture |
| 9 | Rural | KERALA | Kollam | Ponds | Yes | Domestic/Drinking | No reported problems | Natural | NaN | NaN | 2010.0 | 4000.0 | No | 60.0 | 50.0 | Full | Filled up every year | 13 | Ground water recharge | Pisciculture |
| Area_Type | State Name | District Name | Water_Body_Type | Water_Body_Status | Reason_for_Water_Body_Use | Scheme_Status_Reason | Water_Body_Nature | construcion_year | construction_cost | Renovation_Year | renovation_cost | Repair_Renovation_Status | Original_Storage_Capacity | Present_Storage_Capacity | filled_up_storage_name | filled_up_storage_space_name | no_people_benefited_by_water_body | reason_water_body_in_use_name2 | reason_water_body_in_use_name3 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 55724 | Rural | KERALA | Kasargod | Ponds | Yes | Irrigation | No reported problems | Man-made | 1970.0 | 10000.0 | 2015.0 | 10000.0 | No | 4320.000000 | 4320.000000 | Full | Filled up every year | 15 | Ground water recharge | NaN |
| 55725 | Rural | KERALA | Kozhikode | Water consv schemes/percolation tanks/check-dams | Yes | Ground water recharge | No reported problems | Man-made | NaN | NaN | 2012.0 | 1000000.0 | No | 172467.460633 | 142922.561496 | Full | Filled up every year | 12 | NaN | NaN |
| 55726 | Rural | KERALA | Kozhikode | Ponds | Yes | Ground water recharge | No reported problems | Man-made | 1991.0 | 40000.0 | 2005.0 | 10000.0 | No | 144.000000 | 48.000000 | Upto 1/4 | Rarely filled up | 6 | NaN | NaN |
| 55727 | Rural | KERALA | Kozhikode | Ponds | Yes | Religious | No reported problems | Man-made | 1983.0 | 20000.0 | 2004.0 | 40000.0 | No | 160.000000 | 100.000000 | Upto 3/4 | Usually filled up | 25 | Ground water recharge | NaN |
| 55728 | Rural | KERALA | Kozhikode | Ponds | Yes | Irrigation | No reported problems | Man-made | 1981.0 | 10000.0 | 2017.0 | 5000.0 | No | 64.000000 | 32.000000 | Upto 1/2 | Rarely filled up | 10 | Ground water recharge | NaN |
| 55729 | Rural | KERALA | Kozhikode | Ponds | Yes | Ground water recharge | No reported problems | Man-made | 1969.0 | 2000.0 | 2012.0 | 5000.0 | No | 1200.000000 | 600.000000 | Upto 1/2 | Rarely filled up | 100 | NaN | NaN |
| 55730 | Rural | KERALA | Kozhikode | Ponds | Yes | Pisciculture | No reported problems | Man-made | 1988.0 | 10000.0 | 2006.0 | 6000.0 | No | 540.000000 | 450.000000 | Upto 3/4 | Usually filled up | 10 | Ground water recharge | NaN |
| 55731 | Rural | KERALA | Idukki | Ponds | Yes | Irrigation | No reported problems | Man-made | 2003.0 | 70000.0 | 2017.0 | 10000.0 | No | 210.000000 | 210.000000 | Full | Filled up every year | 10 | Domestic/Drinking | NaN |
| 55732 | Urban | KERALA | Kollam | Ponds | No | Not Specified | Others | Man-made | NaN | NaN | NaN | NaN | No | 20.000000 | 20.000000 | Upto 3/4 | Rarely filled up | 681 | NaN | NaN |
| 55733 | Rural | KERALA | Malappuram | Ponds | Yes | Domestic/Drinking | No reported problems | Man-made | 1990.0 | 20000.0 | NaN | NaN | No | 1400.000000 | 1125.000000 | Full | Filled up every year | 100 | NaN | NaN |
Most frequently occurring
| Area_Type | State Name | District Name | Water_Body_Type | Water_Body_Status | Reason_for_Water_Body_Use | Scheme_Status_Reason | Water_Body_Nature | construcion_year | construction_cost | Renovation_Year | renovation_cost | Repair_Renovation_Status | Original_Storage_Capacity | Present_Storage_Capacity | filled_up_storage_name | filled_up_storage_space_name | no_people_benefited_by_water_body | reason_water_body_in_use_name2 | reason_water_body_in_use_name3 | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99 | Rural | KERALA | Idukki | Water consv schemes/percolation tanks/check-dams | Yes | Domestic/Drinking | No reported problems | Man-made | NaN | NaN | NaN | NaN | No | 172467.460633 | 142922.561496 | Full | Filled up every year | 10 | NaN | NaN | 20 |
| 469 | Rural | KERALA | Thrissur | Ponds | Yes | Domestic/Drinking | No reported problems | Natural | NaN | NaN | NaN | NaN | No | 100.000000 | 90.000000 | Full | Filled up every year | 10 | NaN | NaN | 12 |
| 355 | Rural | KERALA | Palakkad | Ponds | Yes | Irrigation | No reported problems | Natural | NaN | NaN | NaN | NaN | No | 150.000000 | 150.000000 | Upto 3/4 | Usually filled up | 15 | NaN | NaN | 11 |
| 313 | Rural | KERALA | Palakkad | Ponds | No | Not Specified | Siltation | Natural | NaN | NaN | NaN | NaN | No | 300.000000 | 200.000000 | Upto 3/4 | Usually filled up | 681 | NaN | NaN | 10 |
| 247 | Rural | KERALA | Malappuram | Ponds | Yes | Other | No reported problems | Man-made | NaN | NaN | NaN | NaN | No | 200.000000 | 150.000000 | Full | Filled up every year | 1 | Ground water recharge | Irrigation | 8 |
| 292 | Rural | KERALA | Palakkad | Ponds | No | Not Specified | Others | Natural | NaN | NaN | NaN | NaN | No | 200.000000 | 150.000000 | Full | Filled up every year | 681 | NaN | NaN | 8 |
| 359 | Rural | KERALA | Palakkad | Ponds | Yes | Irrigation | No reported problems | Natural | NaN | NaN | NaN | NaN | No | 200.000000 | 150.000000 | Full | Filled up every year | 1 | Other | NaN | 8 |
| 9 | Rural | KERALA | Alappuzha | Ponds | No | Not Specified | Siltation | Man-made | NaN | NaN | NaN | NaN | No | 60.000000 | 50.000000 | Full | Filled up every year | 681 | NaN | NaN | 7 |
| 17 | Rural | KERALA | Alappuzha | Ponds | No | Not Specified | Siltation | Man-made | NaN | NaN | NaN | NaN | No | 200.000000 | 150.000000 | Upto 1/2 | Usually filled up | 681 | NaN | NaN | 7 |
| 217 | Rural | KERALA | Kozhikode | Water consv schemes/percolation tanks/check-dams | No | Not Specified | Others | Man-made | NaN | NaN | NaN | NaN | No | 172467.460633 | 142922.561496 | Full | Filled up every year | 681 | NaN | NaN | 7 |